Applying a Data Miner To Heterogeneous Schema Integration
نویسندگان
چکیده
An application of data mining techniques to heterogeneous database schema integration is introduced. We use attribute-oriented induction to mine for characteristic and classification rules about individual attributes from heterogeneous databases. Each mining request is conditioned on a subset of attributes identified as "common" between the multiple databases. We develop a method to compare the rules for two or more attributes (from different databases) and use the similarity between the rules as a basis to suggest similarity between attributes. As a result, we use relationships between and among entire sets of attributes from multiple databases to drive the schema integration process. Our initial efforts and prototypes applying data mining to assist schema integration prove promising and, we feel, identify a fruitful application area for data mining research. goywords : schema integration, multi-database interrelationships, attribute similarity, data mining, attribute-oriented induction.
منابع مشابه
OWL as Yet Another Data Model to be Integrated
The paper argues against cultivation in the ontological community of the opinion that ontologies are at the "semantic" level, whereas database schema are models of data at the "logical" or "physical" level. The paper claims that rather it would be right to consider OWL as yet another data model to be integrated with other heterogeneous information models. Applying the SYNTHESIS – an extensible ...
متن کاملInvestigating a heterogeneous data integration approach for data warehousing
Data warehouses integrate data from remote, heterogeneous, autonomous data sources into a materialised central database. The heterogeneity of these data sources has two aspects, data expressed in different data models, called model heterogeneity, and data expressed within different schemas of the same data model, called schema heterogeneity. AutoMed is an approach to heterogeneous data transfor...
متن کاملSchema Evolution in Data Warehousing Environments - A Schema Transformation-Based Approach
In heterogeneous data warehousing environments, autonomous data sources are integrated into a materialised integrated database. The schemas of the data sources and the integrated database may be expressed in different modelling languages. It is possible for either the data source schemas or the warehouse schema to evolve. This evolution may include evolution of the schema, or evolution of the m...
متن کاملIntegration of Heterogeneous Object
In a heterogeneous database system which consists of object databases, a global schema created by integrating schemas of the component databases can provide a uniform interface and high level location transparency for the users to retrieve data. The main problem for constructing a global schema is to resolve connicts among component schemas. In this paper, we deene corresponding assertions for ...
متن کاملAn Improved Semantic Schema Matching Approach
Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...
متن کامل